Speech Recognition of South African English Accents

نویسنده

  • Herman Kamper
چکیده

Declaration By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe any third party rights and that I have not previously in its entirety or in part submitted it for obtaining any qualification. Summary Several accents of English are spoken in South Africa. Automatic speech recognition (ASR) systems should therefore be able to process the different accents of South African English (SAE). In South Africa, however, system development is hampered by the limited availability of speech resources. In this thesis we consider different acoustic modelling approaches and system configurations in order to determine which strategies take best advantage of a limited corpus of the five accents of SAE for the purpose of ASR. Three acoustic modelling approaches are considered: (i) accent-specific modelling, in which accents are modelled separately; (ii) accent-independent modelling, in which acoustic training data is pooled across accents; and (iii) multi-accent modelling , which allows selective data sharing between accents. For the latter approach, selective sharing is enabled by extending the decision-tree state clustering process normally used to construct tied-state hidden Markov models (HMMs) by allowing accent-based questions. In a first set of experiments, we investigate phone and word recognition performance achieved by the three modelling approaches in a configuration where the accent of each test utterance is assumed to be known. Each utterance is therefore presented only to the matching model set. We show that, in terms of best recognition performance, the decision of whether to separate or to pool training data depends on the particular accents in question. Multi-accent acoustic modelling, however, allows this decision to be made automatically in a data-driven manner. When modelling the five accents of SAE, multi-accent models yield a statistically significant improvement of 1.25% absolute in word recognition accuracy over accent-specific and accent-independent models. In a second set of experiments, we consider the practical scenario where the accent of each test utterance is assumed to be unknown. Each utterance is presented simultaneously to a bank of recognisers, one for each accent, running in parallel. In this setup, accent identification is performed implicitly during the speech recognition process. A system employing multi-accent acoustic models in this parallel configuration is shown to achieve slightly improved performance relative to …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic modelling of English-accented and Afrikaans-accented South African English

In this paper we investigate whether it is possible to combine speech data from two South African accents of English in order to improve speech recognition in any one accent. Our investigation is based on Afrikaans-accented English and South African English speech data. We compare three acoustic modelling approaches: separate accent-specific models, accentindependent models obtained by straight...

متن کامل

Nguni and Sotho varieties of South African English - distant cousins or twins?

It is well established that accent can have a detrimental effect on the performance of automatic speech recognition (ASR) systems. While accents are usually classified in terms of a speaker’s mother tongue, it remains to be determined if and when this linguistic classification is appropriate for the development of ASR technology. This study focuses on South African English as produced by mother...

متن کامل

Multi-Accent Speech Recognition of Afrikaans, Black and White Varieties of South African English

In this paper we investigate speech recognition performance of systems employing several accent-specific recognisers in parallel for the simultaneous recognition of multiple accents. We compare these systems with oracle systems, in which test utterances are presented to matching accent-specific recognisers, and with accent-independent systems, in which acoustic and language model training data ...

متن کامل

Data-driven phonetic comparison and conversion between south african, british and american English pronunciations

We analyse pronunciations in American, British and South African English pronunciation dictionaries. Three analyses are perfomed. First the accuracy is determined with which decision tree based grapheme-to-phoneme (G2P) conversion can be applied to each accent. It is found that there is little difference between the accents in this regard. Secondly, pronunciations are compared by performing pai...

متن کامل

Modeling the acoustic differences between L1 and L2 speech: the short vowels of africaans and south-african English

The acoustic differences between Afrikaans and South African English, spoken as first (L1) and second (L2) language, are measured for nine short vowels. The spoken language data base of 22 male speakers, collected for comparative studies, is described. The features used in an initial comparison of the isolated vowels and vowels in CVC words are the first three formant values and ratios. Signifi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012